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Abstract 

We introduce the blind index coding (BIC) problem, in which a single sender communicates distinct messages to multiple 
users over a shared channel. Each user has partial knowledge of each message as side information. However, unlike classic index 
coding, in BIC, the sender is uncertain of what side information is available to each user. In particular, the sender only knows 
the amount of bits in each user’s side information but not its content. This problem can arise naturally in caching and wireless 
networks. In order to blindly exploit side information in the BIC problem, we develop a hybrid coding scheme that XORs uncoded 
bits of a subset of messages with random combinations of bits from other messages. This scheme allows us to strike the right 
balance between maximizing the transmission rate to each user and minimizing the interference leakage to others. We also develop 
a general outer bound, which relies on a strong data processing inequality to effectively capture the senders uncertainty about 
the users’ side information. Additionally, we consider the case where communication takes place over a shared wireless medium, 
modeled by an erasure broadcast channel, and show that surprisingly, combining repetition coding with hybrid coding improves 
the achievable rate region and outperforms alternative strategies of coping with channel erasure and while blindly exploiting side 
information. 


I. Introduction 

In many communication scenarios, users have access to some side information about the messages that are requested by other 
users. For example, this scenario can arise in caching networks in which caches opportunistically store content that may be 
requested in the future. It can also arise in wireless networks in which nodes can overhear the signals intended for other nodes 
over the shared wireless medium 13. However, since there are many possibilities for what each cache can store at a particular 
time (or for what signals each node can overhear in a wireless network), tracking the exact content of side information at the 
users can be very challenging. Therefore, it is more suitable to require the server only track the “amount” of side information 
at each user, and not its exact “content”. Consequently, a natural question is; how can a sender take advantage of knowledge 
of only the amount of side information to efficiently deliver messages to users? 

To understand this problem, and evaluate and isolate the ultimate gain of such side information, we introduce a basic network 
communication problem with one sender and several users, depicted in Figure The sender communicates a distinct message, 
Wi, to each of K users (labeled i = 1,... ,K) over a broadcast communication channel, while each user, i, has some prior 
side information {(pij) about other users’ desired messages {wj where j ^ i) that it may use to assist in decoding its own 
desired message. However, the sender does not know the precise side information given to each user (i.e., the sender is blind), 
and it must employ a transmission strategy that only uses knowledge of the probability distributions of for all i ^ j. 

We refer to this new formulation as the blind index coding (BIC) problem. Our formulation is a generalization of the 
classic index coding problem SSI, which is a canonical problem in network communication theory and, despite its simple 
formulation, remains a powerful tool for analyzing many network communication settings (see e.g., cHni). The key difference 
in BIC problems lies in the sender’s uncertainty regarding side information: In classic index coding, precise knowledge of side 
information is used by the sender to create transmission strategies that treat message bits differently depending on whether they 
are within or not within side information at each particular user omni. However, in BIC the sender is unable to distinguish 
between such message bits, and thus transmission must “blindly” exploit knowledge of the only the amount of side information. 
As we will see, this minor difference significantly changes the technical challenges of the problem. 

The main question that we investigate in this paper is “To what extent and using what techniques can we blindly exploit 
such side information?” To that end, after formally introducing the BIC problem, our hrst contribution is the development of 
a class of hybrid coding schemes, which XOR random linear combinations of bits from one subset of messages with uncoded 
bits from a disjoint subset of messages. In these hybrid codes, the sender XORs uncoded bits in order to probabilistically 
exploit side information already available at users. We first provide an example to show that this approach can outperform 
random coding and in fact sometimes achieve capacity. We then construct an general achievable scheme for three users based 
on this approach and determine the achievable symmetric rate. 

In order to evaluate the efficacy of our scheme, as well as to gain further intuition beyond three users, our second contribution 
is the development of a new outer bound on the capacity region. An essential aspect of our outer bound is its utilization of a 
strong data processing inequality M which captures the inability of the sender to distinguish between bits of a message known 
or unknown to a given user prior to transmission. We demonstrate that our converse is tight in two special cases: namely, the 
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award N000141310094, and research grants from Intel and Verizon via the 5G project, and was completed while D.T.H. Kao was pai1 of the Universtiy of 
Southern California. D.T.H. Kao is now a part of Google Inc. Parts of this work were presented in Q and |2|. 
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Fig. 1. A K-user blind index coding problem (e.g., K = 3) depicted as a sender-user network with user caches. User i, for i = 1, 2, 3, desires message Wi 
and may use side information about other messages to facilitate decoding; <pij denotes the side information that User i has about Message Wj. The sender 
only has knowledge of the distribution of ^ij, and not its precise realization. Notice the amount of side information available may vary across users and 
messages. 


two-user and symmetric itT-user BIC (where all users have the same amount of knowledge about undesired messages). In both 
cases a simple achievable scheme based on random coding suffices to achieve the entire capacity region. As we move beyond 
these special cases to the general BIC setting, we confirm that, at least for some problem settings, our three-user hybrid coding 
scheme can meet the symmetric capacity upper bound. Finally, we numerically evaluate our new hybrid coding scheme and 
outer bounds relative to existing achievable schemes. 

In our final contribution, we further consider the BIC problem in a wireless setting, specifically studying how lossy sender-to- 
user links can affect schemes to blindly exploit side information and the resulting achievable rates. Interestingly, we demonstrate 
that in addition to hybrid coding (where XORing uncoded bits of a subset of messages with random combinations of the others 
played a key role), quite surprisingly, XORing the same uncoded bits more than once (i.e., repetition of uncoded bits) can 
increase the achievable rate. Equipped with this observation, we then proceed to construct a coding scheme that leverages 
both hybrid codes and repetition of uncoded message bits in order to establish an achievable rate region, and we demonstrate 
numerically that such a scheme offers a strict improvement in achievable rate over conventional approaches. 

To summarize, the main contributions are as follows: 


1) We introduce the Blind Index Coding problem, which generalizes classic Index Coding by considering uncertainty 
(blindness) at the sender about side information given to users. 

2) We propose a class of hybrid coding schemes, which XOR random linear combinations from one subset of messages 
with uncoded bits of another subset. 

3) We derive a novel outer bound on the capacity region of BIC problems which leverages a strong data processing inequality 
to account for the blindness of the sender. 

4) We further generalize the problem to better model wireless settings by studying how lossy sender-to-user links affect the 
efficacy of the hybrid coding schemes, and we find that repetition coding can enhance the performance of hybrid codes. 


This remainder of the paper is organized in the following way. In Section we formally state the BIC problem first for 
error-free broadcast and then for broadcast over lossy channels. In Section |I^ we motivate both the ideas behind hybrid coding 
and our outer bound using a simple example, for which the inner and outer bounds meet. In Section IV we define a hybrid 
coding scheme and study the achievable symmetric rate for the three-user BIC, in Section |V] we state and prove the general 
outer bound for BIC problems, and in Section VI we numerically compare achieved rates to the derived outer bounds. In 
Section VII we consider blind index coding when the sender-to-user links occur over wireless channels. Concluding remarks 


and open questions are presented in Section VIII 


II. The Blind Index Coding Problem 

In this section, we formally define the Blind Index Coding problem by stating the network and side information models, 
and formalizing the notion of capacity. 

Network model: In a BIC problem, as shown in Figure K users each request a message from a sender; i.e.. User i, for 
i = 1,..., AT, desires the m^-bit message Wi, which is drawn uniformly from a space {0, 1}^\ Each user, i, has access to 
side information, fij, (whose form is described later) about each message Wj except the one it desires (i.e., for all j ^ i). The 
sender aims to communicate all messages to the respective users via a common error-free channel. The goal of the problem is 
to design a scheme that maps messages to a channel input vector, x, of minimum length, such that each user can decode its 
desired message. 

Side information model: In a blind index coding problem, each side information signal, fij, is a random fraction of the bits 
that make up the message, wj. We assume that the sender is “blind” in the sense that it is only aware of the average number 
of bits in each side information signal. 

More specifically, we can model the side information in the following way. Let fij be a length-TOj binary vector drawn i.i.d 
from a Bernoulli(1 — pij) distribution. Side information fij = [1], [2],... ,(j)ij[mj]) is such that, for £ = 1,... ,Tnj, 


( 1 ) 
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User i knows gij for all j ^ i, however the sender is only aware of parameters, {nij}, which govern the probabilistic behavior 
of the side information. Note that the side information model is equivalent to either 1) randomly sampling bits of a message, 
or 2) passing a message through a side information channel which is an erasure channel. 

Remark 1. The key difference between the BIC problem formulation and a classic index coding problem of lH |6l lies in 
the uncertainty in message bits given as side information. Notably, if we consider the scenario where gij G {0,1} for all i,j, 
then side information availability is deterministic and known to the sender, and our formulation is identical to i). Thus, BIC 
generalizes classic index coding. 

Remark 2. Index coding problems with transformed and random side information were considered in m and ifThll respectively, 
but in both cases it was assumed that the side information is known to the sender. Another related problem is described in ifTTll 
where pliable users are considered; Users express no specificity in messages demanded and thus which messages to send are 
uncertain. Interestingly, in the cases of pliable index coding with known solutions, either canonical random coding or uncoded 
transmission strategies were sufficient. 

Capacity Region: We now consider a BIC problem with K users and side information parameters {giij} as defined above. 
For this problem, a {ri,r 2 , ■ ■ ■ jCk) scheme with block length n consists of an encoding function and K decoding functions. 

The encoding function, fenc ■ {0,1}", uses the knowledge of {/iy} for all j ^ i to map each of K 

messages (with message Wj consisting of rrij bits such that lim„_>,oo = Vj) onto a length-n binary vector, x, that is 
broadcast to all K users using n channel uses. We reemphasize that the encoding function relies only on the side information 
parameters, and not the side information signals, {4>ij}. 

The decoding function applied by User i, : (0,1}" x l}™^ x (0,1}™^ —>■ (0,1}™% maps the broadcast signal, 

X, as well as AT — 1 side information vector pairs, {4>ij,gij) for all j ^ i, to an estimate of its desired message, Wi. 

We say that a rate tuple (ri,..., tk) is achievable if there exists a sequence of (ri,..., tk) coding schemes with increasing 
block length, n, such that for every i G {1,..., K} 

lim Pr 

n—>-oo 

The capacity region is defined as the closure of the set of all rate tuples (ri,..., tk) that are achievable. 

The goal of this paper is to study the capacity region of the BIC problem. As we show later in Proposition the capacity 
region of a 2-user BIC problem is easy to characterize. Thus, in order to gain a better intuition on BIC problems beyond 2 
users, we focus in particular on the 3-user BIC problem. 

III. Motivating Example 

In this section we motivate both the proposed coding schemes and outer bound using a simple, concrete example. Consider 
a BIC with three users (i.e., AT = 3) and where Users 2 and 3 have full side information about other users’ messages, while 
User 1 only knows a third of each of W 2 and (i.e., gi 2 = p,i 3 = | and p .21 = A 23 = A3i = /^32 = 0)- We focus on this 
specific BIC problem because in this scenario the sender is blind only about side information at User 1 and therefore we can 
focus on the impact of blindness regarding just one user. 

For this particular BIC problem, we will determine the symmetric capacity (i.e., the maximum rate r such that ri = r 2 = 
ra = r is achievable) by assuming the lengths of all messages are the same (i.e., mi = m 2 = m 3 = m where m is large), 
proposing a scheme, and introducing a method to bound the capacity region. 

For the sake of comparison, we will first establish a baseline achievable symmetric rate by considering random coding, an 
often-used approach to coding in the presence of uncertain side information. For example, one natural scheme would be to 
send random linear combinations (RFC) of all message bits (i.e., parity bits to supplement side information) over the shared 
channel, until each user has a sufficient number of linearly independent equations (including side information) to decode all 
of the messages. For this example, by sending m(l -f /ri 2 + p. 13 ) -f o{m) — ^ + o{m) random parities, each user has at least 
3m -I- o{m) equations for 3m unknowns, meaning that with high probability each user can linearly decode all three messages: 
conventional random coding achieves Vsym = f FI 

Notice first that we can achieve better by first grouping 2 and 3 and sending each bit of W 2 XORed with a distinct bit of W 3 
(this requires exactly m transmissions). From these transmissions. Users 2 and 3 can use side information to remove the other 
message and decode their desired message. Then, by sending wi orthogonally in time (requiring another m transmissions) 
User 1 receives its desired message. In other words by treating subsets of messages differently, we achieved rgym = 5 - 

We now demonstrate how to further improve the transmission strategy by constructing a “hybrid coding scheme” using a 
combination of uncoded bits and randomly coded parities to go beyond the rate of i. In these hybrid schemes, during each 
phase of transmission a subset of messages are randomly coded, and then these are XORed with uncoded bits from another 
disjoint subset of the messages. For this example, we only require two such phases. In the first phase, each channel input is 


^ Wi 


= 0 . 


( 2 ) 


*In the subsequent explanation, we omit the o(m) to simplify the exposition. 
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Fig. 2. Illustration of the symmetric-capacity-achieving scheme of the example. The horizontal axis provides scale representation of the number of channel 
uses dedicated to each phase. Transmission type is illustrated using outlined (uncoded) or shaded (randomly coded) Wocks. Notice that, because the sender is 
blind, parts of messages W 2 and W 3 that are known to User 1 cannot be explicitly aligned as discussed in Remark 3 and thus some parts of is XORed 
with as well as some of is XORed with These are displayed as contiguous blocks in the figure for clarity, but in reality would be interleaved 
throughout the first m channel uses. 


generated by XORing a random combination of wi bits, a single uncoded W 2 bit, and , a single uncoded bit. Each uncoded 
bit from both W 2 and W 3 are used only once to generate an input, and thus the first phase consists of exactly m channel inputs 
generated in this manner. Formally, for each £ = 1,... ,m, the sender broadcasts c[£\^ wi © W 2 [£] © tU 3 [f], where c[£] is a 
length-m i.i.d. random binary vector. In the second phase, we send ^ RLCs of only wi bits. 

Notice that with this scheme, if each user decodes its desired message with error probability vanishing as m grows large, we 
achieve rate of rsym = ^ which is higher than the | achieved through conventional random coding and ^ achieved through 
grouped random coding. We now explain why with this scheme such a rate is achievable by explaining how each user decodes 
its desired message; 

User 1: Notice that during the first phase, for each channel input £ € m}, there is a probability of (l — /J-i 2 )(l—£ii 3 ) = 

I that User 1 knew both W 2 [£] and W 3 [^]. In such an event. User 1 can cancel W 2 [£\ © and received a “clean” 
RLC of wi bits. Therefore, during the first phase User 1 receives (approximately) ^ such RLCs. In the second phase 
we supplemented this with an additional ^ RLCs of only wi. When combined, at the end of transmission User 1 
will be able to identify in total m linearly independent equations describing the m desired bits of wi. 

User 2: User 2 already knows all of wi and and therefore can remove their contributions from each channel input of 
the first phase. Thus, after canceling the undesired message contributions. User 2 receives exactly the m bits of W 2 . 

User 3: User 3 already knows all of wi and W 2 and therefore can remove their contributions from each channel input of 
the first phase. Thus, after canceling the undesired message contributions. User 3 receives exactly the m bits of W 3 . 

The key intuition on why we XOR uncoded bits of some messages with RLCs of others is as follows. Assume our objective 
is to create an input signal such that User 1 can use side information to cancel “interference” from undesired messages]^ W 2 
and W 3 . As m grows large the probability that User 1 can cancel a random combination of W 2 or W 3 vanishes, and thus RLCs 
of W 2 and W 3 almost surely add to interference that cannot be canceled. However, by XORing uncoded W 2 and bits, the 
probability that User 1 can exploit side information to cancel interference remains constant regardless of m. It is worth noting 
that while Users 2 and 3 eventually know all three messages (through side information and decoding their desired messages). 
User 1 ends up knowing only parts of messages W 2 and w^. 


Remark 3. To obtain further intuition, we can also interpret the proposed scheme as a form of interference alignment. Let 
w^' and w~ for j = 2, 3 denote subvectors of Wi known and unknown respectively to User 1 via side information. Note that 
w~ may be thought of as the part of Wi that interferes with User 1 getting wi, and that it cannot be canceled. Additionally, 
notice that the lengths of wf and w~ are approximately y and respectively. 

First, consider what strategy the sender could use if it was not blind and could identify these subvectors. It could first send 
RLCs of W 2 and bits knowing that all such content cannot be cancelled by User 1. Then it could send RLCs of wi, 
and w'^ bits, knowing that User 1 can cancel the W 2 and contribution for every such input. Specifically, the non-blind 
sender aligns the bits User 1 can cancel (lu^ and tU;^), as well as the bits it cannot cancel (lu^ and wj"). Via an equation 
counting argument, it is easy to verify that such a scheme achieves a higher rate of | 

When the sender is blind, it is unable to distinguish from and from . However, we would still like to 

efficiently send both W 2 and W 3 simultaneously, and therefore our scheme achieves such alignment probabilistically by using 
uncoded bits from W 2 (W 3 ) to preserve the separation between and (w^ and w^). Figure highlights the two desired 
interference alignment cases, as well as the transmissions where alignment fails due to the sender being blind. 


Remark 4. Our scheme’s probabilisitic alignment is obviously less effective than the explicit alignment that is possible when 
the sender knows the side information. This loss of effectiveness is particularly well captured by the amount of additional 

^We focus on User I’s ability to cancel contributions of other messages, since by assumption User 2 and 3 have full knowledge of undesired messages and 
can cancel any such interference perfectly. 
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interference incutTed by our scheme, and thus any attempt at a converse must capture the amount of interference incurred as 
a result of blindness. 

Notice that in both schemes, because we must send all of W 2 to User 2, we incur at least ^ bits of interference from wf. 
In the case of the non-blind sender, one can show that this is all the interference that is incurred at User 1, because the sender 
can fully align the interference caused from the messages wf and wf at User 1. On the other hand, because the sender is 
blind, our scheme incurs an additional | x | x m bits of interference. In Section we present a general outer bound that 
shows that this additional interference is indeed unavoidable and hence the scheme presented above is indeed the best possible 
in this example. More specifically, we will prove a generalization of the following inequality: 




1^3 ) + 


(3) 


The above inequality lower bounds the interference at User 1 (i.e., the unknown parts of W 2 and W 3 ) with a convex 
combination of terms that either represent providing none of W 2 as side information {H{x\wi,w^) or all of W 2 as side 
information (H{x\wiW 2 ,w^)). The coefficient weights that describe the combination are a function of the side information 
parameter /r = |. A more general form of this inequality is the key lemma used to construct the outer bound. 


Before concluding the section, we point out that this inequality is valid only when the sender is blind. Indeed, if we consider 
the non-blind sender like in Remark the correct inequality would be 

2 

H[x\wi,w^,w^)> -H[x\wx,w^), (4) 

which is clearly looser than Q. Note that the additional term that appears in ([^ but does not appear in Q captures additional 
interference due to blindness of the server. 


IV. Achievability 

In this section, we study achievable rates in the BIC problem. As alluded to in the motivating example, one possible approach 
to dealing with blind side information at users is random coding. For example, in a standard random linear code applied to 
binary message sequences, each channel input is created by XORing random linear combinations of all bits from all messages. 
In the rest of the paper, we refer to this approach as conventional random coding. Using conventional random coding requires 
that all users decode all messages, thus rate tuples are achievable if and only if they satisfy, for every i G {1,..., K}, 

+ <1. (5) 

In some cases, this suffices to achieve the full capacity region. For example, we will see that the rate region achievable by 
conventional random coding in the following two scenarios exactly matches the outer bounds derived in the next section]^ 

• 2-user BICs, for any value of fii 2 and ^ 21 - 

• Symmetric iT-user BICs, where fj,ij = /i for all i ^ j. 

However, as demonstrated in the previous example, conventional random coding is not optimal in general, and in the rest of 
this section we propose a new hybrid encoding strategy that XORs random combinations of all bits from some messages with 
uncoded bits from others. This hybrid between random coding and uncoded transmission is the key mechanism to blindly 
exploit side information. For simplicity, we focus on symmetric rates achievable in an arbitrary 3-user BIC. We will first state 
the achievable symmetric rate as a theorem, then describe the encoding and decoding strategies before finally proving that the 
symmetric rate claimed in the theorem is indeed achievable. 


A. 3-user BIC Hybrid Coding 

We now state the 3-user symmetric rate achievable using hybrid coding. We then define a hybrid encoding scheme for 3-user 
BIC problems, using key points from the motivating example. 

Theorem 1. Consider a 3-user BIC problem, defined by parameters {/iy }, where WLOI^user indices are such that 

Ai32 </i23 < niax{pH,^ii}, (6) 

for either i G {2,3}. Any satisfying the following is achievable: 



< max 


11 + M23 + M 12 + /^ 13(1 — B23 + M 32)(1 — ^12) ’ 1 -f ^12 + /ri 3 j 


(7) 

( 8 ) 


^Theorem Instates the outer bound while the capacity regions for the two scenarios are formally stated as Propositionsandrespectively. 
“^For any three users, such a condition must hold for at least one permutation of indices. 
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Fig. 3. Hybrid coding scheme for 3-user BIC, where fii 2 = /r .13 = ^21 = ^23 = 

boxes represent RLCs of a single message. 


^31 = ^32 = -r. Outlined boxes represent uncoded bits, shaded 


Remark 5. Consider r^m satisfying Q and (|^. In the right hand side of if the second term within the maximization 


is larger, then (7 1 and (8 


simplify to rsym < min < 


1 1 1 
1 + A»21+A»23’ l + r‘31+r‘32’ 1+M12+M13 


. In this case, from (5 i it is clear that 


conventional ranaom coding suffices to achieve the desired rate. Hence, our hybrid coding scheme increases the symmetric rate 
whenever the hrst term in the max of ([^ is larger. Additionally, since conventional random coding suffices when the second 
term is larger, to prove Theorem [T] we need only to describe a scheme and prove achievability for rsym satisfying 


rsym < min 


_^^^_ 1 

1 + ^-21 + 1^23 1 + + ^-32 1 + ^-23 + 1^12 + Ml3(l ” ^^23 + M32)(l — ^ 12 ) j 


(9) 


We now dehne our hybrid coding scheme where, for any rsym satisfying (j^, the sender will communicate m = nr gym — Sn 
bits where (i5„ is chosen such that Sn = o(n)) to each user in n channel uses, such that probability of eiTor vanishes as n goes 
to inhnity|^ 

Encoding: The hybrid coding scheme is characterized by three parameters, Ni, N2, and N^. For each i G {1,2,3}, we 
generate Ni random linear combinations (RLC) of the bits only in Wi, denoted by vector J^. The precise values of Ni, N 2 , 
and A 3 are specihed later, however we point out as depicted in Figure that Ni — m > N 2 > N 3 . 

As shown in the hgure, the sender combines RLCs and uncoded bits of messages in hve phases. During Phase 1, each 
input is the XOR of one bit from each of Ji, J 2 , and J 3 , where we take bits from each vector sequentially. Phase 1 ends and 
Phase 2 begins when the bits in J 3 are exhausted (i.e., after N 3 channel uses). Similarly, the number of channel uses allocated 
to each phase of transmission are dictated by when we exhaust the bits of a certain type: Phase 2 inputs consist of an XOR 
of Ji, J 2 , and W 3 bits, and ends when we have no more bits from J 2 ; Phase 3 inputs consist of an XOR of Ji, W 2 , and W 3 
bits, and ends when we have no more bits from w^, Phase 4 inputs consist of an XOR of Ji and W 2 bits, and ends when we 
have no more bits from W 2 ', and Phase 5 inputs consist of only a Ji bit. 

Decoding: We now describe the decoding scheme of each user. Users 2 and 3 each decodes all 3 messages. As in conventional 
random coding, this requires that User 2 and 3 each receive a sufficient number of independent linear combinations of messages 
bits, either via side information or the shared channel. 

A key point in our coding scheme lies in how User 1 exploits the hybrid coding structure to decode wi. As in the example. 
User 1 uses side information to cancel out the combinations of known W 2 and bits from symbols received in Phases 3 and 
4. It uses these “clean” RLC of only wi bits along with those RLC received during Phase 5 to linearly decode only wi. 

For the scheme to achieve rgym (Fe., in order for decoding error probability to vanish as n grows large), we claim that 
choosing A^i, N 2 , and as 


Ni=n and N 2 = nrsymi^23 and = nrsymd'32, (10) 

results in a probability of decoding error that vanishes as n —> c». We prove this formally in the following subsection. 

Remark 6. Recall from the illustrative example, we wanted to maximize the chance that User 1 can clean W 2 and content 
from a transmission, and we assumed that both User 2 and 3 decode all three messages. Thus the phases of transmission in 
Figure]^ have the following roles: Phase 5 provides RLCs about wi to User 1. Phase 5 also provides enough wi RLCs for 
each of User 2 and User 3 to decode wi (with the help of their side information). A fraction, (1 — ^ 12 ), of Phase 4 is useful 
to User 1 after using side information to clean the W 2 component, to obtain a clean RLC of only wi. Similarly, a smaller 
fraction, (1 — pi 2 )(l — Mi 3 )> of Phase 3 is useful to User 1 by cleaning both the W 2 and W 3 components, to obtain a clean 
RLC of wi. Note that User 1 only uses clean RLCs from Phases 3-5 to decode wi. Because Users 2 and 3 each decoded wi 
from Phase 5, each cancels out the wi component from Phases 1-4, and then each uses the remaining residual symbols to 
decode both messages W 2 and ws. 


B. Proof of Achievability 

We now address the achievability of rate rgym satisfying 0 , using the hybrid network codes we just dehned. Before 
proceeding we recall that the message size m is such that m = nr — 6 n, where Sn is positive and Sn = o{n). 

^The o(n) term 1) accounts for the fact that m must be integer, and 2) as we shall see, ensures that decoding error will vanish as n grows large. 
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To prove that the rate is achievable, we must show that the probability that any user does not decode its desired message 
vanishes as n —oo (i.e., Pr[zi;i ^ w^] —^ 0). Moreover, since our decoding strategy requires that User 2 and User 3 decode 
all three messages, we also show that the probability of decoding error of all messages at Users 2 and 3 vanishes as n —?> oo. 
Specifically, we have the following possible error events, each of which must approach 0 as n —oo: 

£i: User 1 fails to decode wi. 

El'- User 2 fails to decode {rui, W 2 , W 3 }. 

Ez- User 3 fails to decode {wi,W 2 ,W 3 }. 

For each event we will separate the error analysis into different sources of error, and for each source of error we will 
use one of two analysis techniques to prove that the probability of such an event occurring vanishes with large n. In order 
to provide clarity, and since User I’s decoding strategy was the primary difference between hybrid coding and conventional 
random coding, we will revisit these two techniques after hrst applying them in the context of analyzing the probability of 
event Ei- 

Recall that User 1 first uses its side information to “clean” transmissions from Phases 3 and 4 resulting in random linear 
combinations (RLCs) of only bits from wi. It then combines clean RLCs with those received during Phase 5 (recall from 
Figure]^ that Phase 5 only has wi content) and attempts to linearly decode wi. Therefore, we express User Us decoding error 
as the union of two events, Ei = Eia U En,, defined as: 

Eia'- The total number of random linear combinations (RLCs) cleaned from Phases 3 and 4 and received in Phase 5 is 
less than m + S^, where grows with n and 0 < < Sr,. 

£ib'- The random matrix that describes the transformation of wi to received (clean) RLCs is rank dehcient. 

We will now proceed to show 

Pr[£:i] = Pr[£:ia U Eu] = Pr[£:ia] + Pr[£^f„ C Eu] = o{n). 

We first address Pr[fia]. By the scheme’s construction: 

• Phase 3 has duration m — N 2 + -/V 3 , and the probability of cleaning each transmission is (1 — /ii 2 )(l — Mia)- 

• Phase 4 has duration N 2 — N 3 , and the probability of cleaning each transmission is (1 — 1112 )- 

• Phase 5 has duration Ni — N 2 — m, and each transmission is a clean RLC of wi. 

We may thus represent receiving a clean RLC in the f-th channel use of Phase 3 as a Bernoulli(l — /ii2 — /ii 3 + M12M13) 

random variable which is i.i.d. across £ = 1 ,... ,m — N 2 + and receiving a clean equation in the £-th channel use 
of Phase 4 as a Bernoulli(l — ^ 112 ) random variable A 4 [£] which is i.i.d. across £ = 1,..., A ^2 ~ -A^ 3 - We now note that the 
duration of Phases 3 and 4 (D^ and D4) are by construction, 

-D 3 = TO - A^2 + -/V 3 = nrsymi^ - M23 + M32) - Sn, 

D4 = N2 — = nrsymitJ ‘23 — M32), 

and that the duration of Phase 5 may be bounded as 


z?5 = Ni — N 2 — TO 

= n - nrsymi^ + M 23 ) + Sn 

(a) 

> nrsym{l + /i23 + M 12 + Ml3(l - M 23 + M32)(1 - /I 12 )) - nrsymi^ + M 23 )) + S„ 
= nrsym[y-12 + Ml3(l - M23 + M32)(1 - /ri2)) + Sn 

= nrsymi). ~ (1 ~ M 23 + M32)(1 — /il 2 )(l ~ M 13 )) ~ (M23 ~ M32)(1 — ^ 12 ) + Sn 

> nrsym - D‘i{l - Mi 2 )(l - M 13 )) - - 04(1 - P 12 ), 

where step (a) results directly from (|^. From this, the probability of Eia occurring is, in the limit, 

' 03 -D 4 \ 

^3[^] + y]] A4[£'] + II5 j < m-\- Sn 


( 11 ) 


lim Pr[£ia] = lim Pr 


(b) 

< lim Pr 

n—^oo 


= lim Pr 

n—>-oo 


= lim Pr 

n—foo 


l=\ 

Dr 


t' = l 
Di 


A 3 [£] + y] A4[£'] + nrsym - - 03(1 - Mi2)(l - M13)) - - 04(1 - M12) <m + S„ 


' D 3 


e'=i 


y]A 3 [£]-E 

e=i 

D 3 

y]A 3 [£]-E 


Dr 






D 4 


e=i 


EA4[£]-E 

e '^1 

D 4 

EA4[£]-E 


Dr 


•' = 1 


E A 3 

.e=i 

D 3 

E A 3 


+ nrg 


< m + Sr 


e=i 


+ TITsym I ^ tit sym Sn Sn 


















< 


lim Pr 

n—>-oo 


8 


\( 

E£iA3[€]-E 

E?iiA3[^]' 




1 - 

/ 

s 

1 

_1 


n 



n 


1 n 



where in (b) we applied the bound (111 while noting that if a > 6 then Pr[a < c] < Pr[& < c], and in (c) we invoked the law 
of large numbers while noting that 6 n — is positive by construction. 

Now consider the event n Eib- This describes the case where User 1 receives enough (i.e., m + S^) clean equations 
but the randomly generated matrix that maps wi to clean equations has rank less than m. This type of error is well studied 
throughout the network coding literature. For instance, from expression (3) of IISl, the probability of a m x m + random 
binary matrix having rank less than m can be bounded as 

Pr(£-^,n£-i,)<2-^^, 


which implies, as desired. 


lim U £ib) = 0. 

n—>-oo 


We now revisit the analysis and note that £ia may be thought of as the event where the actual amount of randomly available 
side information was “not enough” because it deviated significantly from the mean. On the other hand £^^ n £ib describes the 
case where there was a sufficient amount of side information, but the randomly generated coding scheme failed to communicate 
the remaining desired message bits. For each type of error we applied a different analysis technique. To address the first, we 
applied a concentration inequality to show that the probability that the amount of randomly available side information deviates 
significantly from the mean vanishes as n grows large. To address the second, we applied existing analysis on the properties 
of randomly generated matrices to show that the probability of a rank-deficient encoding matrix vanishes as n grows large. 

To prove that the probabilities of error events £2 and £3 also vanish as n grows large, we must systematically break down 
these error events into subevents of these two types. Since these two users apply the same decoding process, we now focus 
on User 2 , and we identify such subevents. 

Recall the User 2 first uses its side information and Phase 5 transmissions (i.e., RLCs with only wi content) to decode wi. 
Let £ 2.1 be the event where User 2 fails to decode wi which we further breakdown into the following subevents, £ 2.10 and 
£ 2 , 16 : 

£ 2 Sa- User 2 does not receive enough side information, i.e., I^p 2 i < where 6 ^ grows with n and 

0 < < Sn- 

£ 2 .ib' The random matrix that describes the transformation of wi to Phase 5 RLCs is rank deficient. 

One can verify using the same methods as in the analysis of £1 that the probability of either £210 or £^ 3 ^ n £210 occurring 
vanishes with large n as long as the rate rgym satisfies •El- 

Next, recall that after decoding wi, User 2 removes wi content from Phases 1^, and proceeds to decode both W 2 and W 3 . 
Let £2,{2,3} denote the event where User 2 fails to decode {'W 2 ,W 3 }, and we now study specifically £2 1 H £2,{2,3}- Notice 
that by construction, after the wi content has been removed. User 2 will receive some uncoded bits of W 2 from Phase 4. 
Furthermore, User 2 can also use its side information to clean the W 3 component from some transmissions during Phase 3 to 
receive more (independent) uncoded bits from W 2 - Noting this observation, we can now specify the final two error events for 
£2 analysis; 

£| 1 n £ 2 ,{ 23 }a- After decoding wi and removing its content from Phases 1^, the number of bits about W 3 User 2 learns 
from side information and the number of clean uncoded bits about W 2 User 2 learns from Phases 3 and 4 is significantly 
less than the mean. 

£| 1 n £ 2 _{ 23 }f): After decoding wi and removing its content from Phases 1^, the random matrix that describes the 
transformation of W 2 and W 3 to transmissions in Phases 1 and 2 is rank deficient. 

Again, one can verify using the same methods as in the analyses of £ 1 ^ and £if, that the probabilities of these events occurring 
vanish with large n as long as the rate rgym satisfies (j^. Using the analyses of these subevents, we have 


lim Pr [£ 2 ] = lim Pr [£ 2 , 1 ] + Pr [£| ^ n £2,{23}] 

n—^oo n—>-oo ’ 

= lim Pr [£ 2 ,la] + Pr [££ia H £ 2 ,ib] + Pr [£|_3 n £ 2 .{ 23 }a] + Pr 


n—>-oo 

= 0 . 


'^2,{23}a ^2,{23}b 


Through similarly identifying subevents of £3 we can also establish that Pr[£3] —0 as n —> 00. Therefore, the probability 
of decoding error at each user vanishes as n grows large. □ 
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V. Outer Bound 

In this section, we present an outer bound on the capacity region of the BIC problem. We will hrst state and prove the 
bound for the 3-user setting and remark on its implications. We then introduce a key lemma and prove the 3-user outer bound. 
Finally we state a general expression for an outer bound on the general iF-user BIC capacity region. Its proof is relegated to 
Appendix 


A. 3-user Outer Bound 

We begin by stating the following result: 

Theorem 2. Consider a 3-user BIC problem. Rates (ri,r 2 ,r 3 ) are achievable only if, 

\ltij ltkj\ Bjk 

1 Bkj 

for any i ^ j 7 ^ fc G { 1 , 2 ,3} and [af = max{a, 0 }. 


-f pik 


rk < 1 , 


( 12 ) 


Remark 7. If the sender is not blind (i.e., the side information is known), our BIC problem can be converted to an analogous 
classic index coding problem with each user, i, desiring four different messages whose rates sum to by and whose proportion 
are determined by pji and pkt for i f j f k. 

For this resulting classic index coding problem, using the coding techniques of mi, it can be shown that rate tuples 
(ri, r 2 , ra) satisfying, for alH 7 ^ j 7 ^ fc G {1, 2, 3}, 

Ti -f Pijrj pikPjkTk < 1 

are achievable. Notice that for some side information parameters (e.g., when /j ,23 = P 32 = 0 and pij > 0 for j = 2,3), the 
rates achieved by a non-blind sender can be greater than the BIC outer bound, ( [T^ . The key difference in expressions is the 
third term on the left side of ( 121 , which captures (at least partially) the capacity loss due to sender blindness. 


Remark 8. By evaluating Theorem and comparing with the condition for achievability using conventional random coding 
0 we arrive at the following result: 

Proposition 3. Consider a 2-user BIC defined by parameters pi 2 and p 2 i- The capacity region is the set of all rate pairs 
(ri,r 2 ) satisfying 


Tl + ^ 12^2 < 1 , 

B2iri + r2 < 1. 


(13) 

(14) 


Proof: The converse results from Theorem|^by letting i,j G {1,2}, fc = 3 and hxing = 0, while achievability is a 
result of evaluation of (|^. ■ 

B. Proof of Theorem 

To prove Theorem we start by stating and proving a key lemma: 

Lemma 4. Consider a BIC problem with side information parameters {pij}. Then, for any (ri,... ,rfc) scheme with block 
length n and any random variable V that is independent of Wj and fij with i j (but may depend on other messages and 
channel parameters), we have 


H 


> pijH (x|y) -f (1 - Pij)H {5c\wj, V). 


Additionally, if pkj < Rij where i j k, then 


H (x C) > (x|C) + (x fikj,gkj,v) . 

V / i — LLki i — ttki V / 


1 pkj 


1 Pkj 


(15) 


(16) 


Remark 9. Inequality (15 1 captures an intuition that can be illustrated through the following toy problem. Consider a scenario 
where the sender has 4 bits 61 , 62 , Ci, C 2 . It knows that User 2 knows Ci and C 2 already and User 3 knows bi and b 2 . On the 
other hand, the sender only knows that User 1 knows either bi or 62 (but not both) and either ci or C 2 (but not both) and that 
both of these uncertainties are the result of a (fair) coin flip. If the sender sends a single transmission such that both User 2 
and User 3 learn something new about 6 i,& 2 ,Ci,C 2 , what is the minimum probability that User 1 also learns something new? 

One possible transmission would be to send bi 0 Cl. In this case. User 2 learns ci and User 3 learns bi, and there is a 
75% chance that User 1 learns either bi, ci, or &i 0 ci. In comparison, we can evaluate (15 1 for the porposed transmission by 
letting i = 1, } = 2, fc = 3, pi 2 = P 13 = and assuming W 2 = [^i ^ 2 ], = [ci C 2 ], and V = (<^ 13 , 513 ). In doing so. 


we see that the right hand side of (15 1 evaluates to /ri 2 (l) 0(1 — Bi 2 ){pi 3 ) = signifying that the 75% chance of “leaking” 


information to User 1 is the lowest possible. 
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Notice that, as stated. Lemmadoes not assume decodability of any message. Moreover, it applies regardless of the number 
of channel uses, whereas the toy example assumed only a single channel use. Consequently, Lemma can be viewed as a 
powerful extension of the intution from the toy example to vector (i.e., coded) representations of message bits. 


Remark 10. One can note that if the transmitter is not blind, the sender can construct a signal that invalidates 4. For example, 
consider the scenario in Remark]^ but now assume that the sender is aware that User 1 knows bi and ci. The sender can now 
use this knowledge to send a single transmission bi 0 ci. One can easily verify that, for this one transmission, the left hand 
side now evaluates to 0, while the right hand side evaluates to i which violates the claim. Therefore, the inequality specifically 
captures the impact of a blind sender. 


Remark 11. Inequality (15i can more generally be interpreted as follows. Note that H{x\V) corresponds to the case that 
there is no side information about Wj provided in the conditioning, and H(x\V, w,) corresponds to the case that all of Wj is 
provided as the side information in the conditioning. Therefore inequality (15 i lower bounds iT(x|0ij , , V) with a weighted 

average of two extreme cases, where either none or all of wj is provided as side information. A similar interpretation holds 
for (161, where Wj is replaced with 


Proof: To prove Lemma we first define a virtual side information signal, cf', such that fij is a physically degraded 
version of </>'. To do so we also specify two channel state sequences, f and drawn i.i.d from two different Bernoulli 
distributions that take a values of zero with probabilities fi' and 6 = , respectively. The side information signals are 

constructed such that for f S rrij}. 


g'[i]wj[e] and gij[i]= 9'[i]g^[e], 


(17) 


which necessarily implies p' < 

We now establish a relationship between the virtual side information signal, f, and the degraded side information, fij, 
using a strong data processing inequality proven in lfl4l which states, for random variables U ^ X ^ Y, that form a Markov 
chain. 


where 


I{Y-U)<s*{X-Y)I{X-U), 
s*{X,Y)^ sup 


QMDiQxWPxY 

and Qy is the marginal distribution of Y from the joint distribution Qxy = Py\xQx- We apply the strong data processing 
inequality by letting U = (x, V), X = (</)', (f), and Y — to show 


77 X 


(x 


t'lj 1 gij I ^) — I 

= -I 


I gij j ■ 


> - 


Vj +H{x\V) 
y,Py;x,U^ +H{x\V) 
s* (('?>^); ($ij^gij)) I 


H{x\V) 


( q ) 1 f^ij 


(l (<?', x|U) 0 / ^) ) + i? (S|T^) 

^^(i7(x|U)-i7(x|u,^',^)) +i7(x|U) 


Ptj - g 


l-M 


Yh (x|U) 0 (x|u, f^g') . 


(18) 


Recall that we only require g' < gij in 


Step (a), where we evaluated s* ; (^$ij,gij'^'^, is proven in Appendix 

order for the virtual signal to be properly defined, and we notice the following to^omplete the proof; 

• If /i' = 0, then {(j)',g') = {wj, 1) and we prove (15 i, 

• If p' = gkj < Pij, then {(j)',g') is statistically equivalent to {4>kj,gkj) and we prove (16i. ■ 

We now use Lemmato prove Theorem]^ First, we note that two side info parameter relationships affect the form of ( [T^ : 

The term (iLz—— k'jkY nr.n'7#irr. r^nU; if Kr.tli ^ !inrl /; ^ Tn tliic 


l-^kj 


Ti 0 Ptjrj 0 \^gik - 

Otherwise, if either gkj > Pij or gjk > gik, then 


is nonzero only if both gkj < Pij and gjk < gtk- In this case, 

{pij PkjYpik pjk') 


1 Pkj 


Tk < 1- 


(19) 


A 0 g^jrj 0 gikVk < 1. 


( 20 ) 
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We prove these two cases separately, and only address the first case, here. The proof of ( | 20 ) l will use similar techniques, 

and may be found in Appendix [B] We therefore assume fj,kj < ^J■ij and < ^ik, and start with Fano’s inequality at User i: 


jiTi < I {x,$ij,gij,$ik,gik',w^ +o{n) 


(l>ij j gij ^ gik^ H (l)2j ^ gij ^ 9ik^ “t“ o(7r) 


= H 
< n — H { X 


(a) 

< n — 


tUz , 4^ij : gij 1 gikj oi^Tl'j 
A 

f^ij l-^kj 


1 - 


f^kj 


N 1 _ 

H (^c\wi,(j)ik,g^k ) “ 

^kj 


where in step (a) we applied ( 16 i from Lemma 


$ik, gtk'^ 

4 by letting V = (^w^, ^,k, giky 



B 

/ 

_ \ 

H (x 

’^i 5 0k j : 9k j : 0ik ■> 9ik] 


( 21 ) 


( 22 ) 


Notice there are two negative entropy terms, 


A and B, to account for. To address the quantity A, we enhance side information at User j from ( (f>ji, gji j to Wi, and observe; 


TITj A I gji-! 4^jk^ 9jkt '^j ] “f 

< I 


= H 
{b) 

< H 


{x,Wi,$jk,gik',Wj^ +o(n) 

Wi,$jk,gjk'^ - H {x Wi,Wj,$jk,gjk^ + o{n) 


(S Wi,(j3jk,gjk'^ - 9jkH {x\wi,Wj) + o{n) 
.4 


< H X 


^x w^,$^k,gi^y -9jkH {x\wi,Wj) +o(n), 


( 23 ) 


where in (b) we used (15 i from Lemma while letting V = {wi,Wj), and in (c) we observe that, since gkj < g-ij and that 


9jk[^] — (* implies (fpjkli] — ^^9jk[^] = 0) is independent of Wi and x, replacing \4>jk,gjk] with 
effective conditioning (see Claim 


(wj,Wk) to find; 
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ypik ) 9ik^ 


reduces the 


in Appendix Cb. At User k we enhance side information from (<pkiTgkij'i^kj,gkj) to 


TkTk A I ^X, (j^ki ) gki ^ Tj)kj 5 9kj : ^0}^ A b){Tl) 

< I {x,Wi,Wj;Wk) + o(n) 

< H {x\wi, Wj) + o(n). 

To account for the quantity B, we observe 
ngikTk = TiTk - n(l - gik)rk 

< I (x,(fki,gki,$kj,gkj;Wk'^ - n{l - g,ik)rk + o{n) 

< I (x,w^,$kj,gkj,$ik,gik;Wky - n{l - g,ik)rk + o{n) 


( 24 ) 


= H 
= H 

(d) 

< H 


{^\Wi^ (^kj ^ gkj 1 bj^iki gik^ H ^X 

^X Wi , (j^kj : 9 kj 1 4 ^ik j 9 ik'^ ^ 


5 Ykj 5 


gkj,Wk'^ + I [$ik,gik\w,y - n(l - gik)rk + o(n) 


Wi,$kj,gkj,W}y + o(n) 


(^x Wi,$ki-,hi-,$ik,gi}y - gijH {x\wi,Wk) + o(n), 


/ 


H fx 

1 0k j 5 9k j 5 0ik ; 9'ik 1 


-gkjH {x\wi, Wk) + o(n). 
In step (d) we used ( [T6] l. Also, like ( 24 i, we find 

riTj < H{x\wi,Wj) + o{n). 


By appropriately scaling ([ 23 |, ( 24 i, ( 25 i, and ( 26 1, and then summing with 


arrive at (fTO]), as desired. 


( 25 ) 

( 26 ) 

□ 
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Fig. 4. Possible OBT for K = i users. Notice that the sequence of labels along each root-to-leaf path is a permutation of the user indices. 


C. K-user Outer Bound 

The construction of the bound is governed by a recursion specified using a tree data structure which we refer to as an outer 
bound tree: 


Definition 1 (Outer Bound Tree (OBT)). A AT-user OBT is directed labeled tree with K levels where each node in the first 
K — 2 levels has 2 children and each node in the K — 1-th level has one child. The label of the i-th node in level £ is denoted 
as v[i, i] S {1,..., K}, where if £ < K then z S {1,..., 2^~^} and if £ = K then i G {1,..., 2^~^}. The index i specifies 
the precise location in the level: nodes i = 2j — 1 and i = 2j in level £ < K are the left and right children, respectively, of a 
node j in level £—1. Node i in level K is the sole child of node i in level K — 1. Finally, the labels of an OBT must satisfy 
the following: 

1) For any path from the root node of the tree to any leaf node, no labels are repeated. 

2) Any two nodes with the same parent cannot have the same label. 


The first requirement is equivalent to saying that the sequence of labels along any path from root to leaf is a permutation 
of {1,..., AT}. This is demonstrated in Figure]^ where we provide an example of an 4-user OBT. 

We now state the following outer bound for the AT-user BIC: 


Theorem 5. Consider a K-user BIC with AT > 3, defined by parameters {pij}- The rate tuple (ri,..., tk) is achievable only 
if it satisfies, 

rA[i,i]<i, (27) 


for any K-user OBT, where 




Tb[£A 




+ CA iAA[£ +I,2i-1] + {1- CA i])rB[£ + 1, 2z] if£<K-l 

^f ^ Tf 1 , 

0 otherwise 


r [£-IAA\] + C[^, iAA[£ + 1, - 1] + (1 - C[A i\)TB[£ + 1, 2z] 

j Vv[.t,i] \f' r^l] “f C[^5 ^]^up-t-l,z] 

[o 


1 ~ Bv[l.+ l,2i],v[e.+ l,2i-l] 
Tlv[t+l,2i-l] *] 

0 


if£<K-l 

if£ = K-l' 
otherwise 


if£<K-l 
if£ = K-l, 
otherwise 


where we have, if £ > 1, 


r]j[£,i] 


and if £ = 1 


{“ 

[^min 1 , [21]} 


if j = v[£,i], i is odd 
ifj = v[£-l, [§]] 
otherwise 




1 iff = t;[l, 1] 

otherwise 


(28) 


(29) 


(30) 


(31) 


( 32 ) 
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Fig. 5. The OBT for Theorem|^ 


The proof of Theorem may be found in Appendix Here we remark on how the intuitions from Theorem are extended 
to K users. 

Remark 12. Consider the 3-user bound with respect to the more general statement of Theorem]^ Figuredepicts the exact 
assignment of labels for a 3-user OBT that results in Theorem S 

Recall that the construction of the outer bound in Theorem Hjbegan with applying Fano’s inequality at User i (i.e., the root 
node label of the OBT), and then applying ([T6]l of Lemma Applying ([T^ resulted in two terms A and B in (|22ll, each of 


which was canceled by analysis of a different user with enhanced side information. This is reflected in the first case of (28 i, 
where in addition to the rate of the user associated with the node label, we have the quantities r^[-] and Fb[-] associated with 
the expressions that will cancel A and B, respectively. The scaling terms C[(-A] reflect the appropriate scaling terms needed 
for the cancellation; e.g., consider the final step in the proof of Theorem where we took a weighted sum of (22 h26i. The 
last quantity, i], tracks the side information enhancement through each level of recursion. 

Remark 13. It is worth noting that the terms associated with the K — 1-th layer of the OBT are special: This layer represents 
the “base case” of the recursion, and in the 3-user scenario, we reached this base case after only one application of ( [T6] ). At 
the K — 1-th layer, instead of ([T^ we apply (15i which is reflected by the associated value of in 1301. 


Remark 14. By evaluating Theorem and comparing with the condition for achievability using conventional random coding 
Q we arrive at the following result: 

Proposition 6. Consider a K-user BIC where fXij = fifor all i ^ j. The capacity region is the set of all rate tuples (ri,..., tk) 
satisfying for every i € {1,..., K} 


Ti + 


1 #* 


. < 1 - 


(33) 


Proof: Achievability results directly from evaluation of Q. To prove the converse, we observe that when p,ij = p for all 
j i, for all f 


and if £ > 1, 




C[£,i] = 0, 

if j = v[£, i] and i is odd 
if j = v[£-l, [|]] 
otherwise 


(34) 


Evaluating recursively through the OBT yields 

1] = T„[ 14 ] + pry[2,2] + ■ • ■ Tfv[K,2^] ^ 1- 

Since the path from root to leaf is a permutation of user indices (i.e., all user indices are represented and there exist no repeats), 
we arrive at ([33). ■ 


VI. Numerical Results 

In this section we perform numerical analysis of inner and outer bounds to illustrate 1) the gain in achievable rate of hybrid 
coding over conventional random coding, and 2) the gap between our derived inner and outer bounds. 

To limit the scope of possible configurations (parameterized by /i^ terms), we focus on two symmetric scenarios for a 
representative set of parameters. In the first scenario, we consider side information that is “one-sided symmetric” (i.e., network 
parameters such that for all i y£ j k) while in the second, we consider side information that is “pairwise 

symmetric” (i.e., network parameters such that p,ij = p,ji for all i j). For each scenario, we will assume that, the size of 
side information at User 1 is the least and at User 3 is the most, and we plot the following: 
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1 ) 

2 ) 

3) 

4) 


The 3-user outer bound of Theorem applied to the symmetric rate. 

The achieved symmetric rate of the hybrid coding scheme described in Section IV 


The grouped random coding strategy (described at the beginning of Section IIIi wherein first, a sufficient number of 
random equations are sent such that Users 2 and 3 can decode W 2 and w^, and then wi is sent. 

The conventional random coding strategy (also described at the beginning of Section III i wherein a sufficient number of 
random equations are sent such that all users can decode all messages. 



amount of side information of the least knowledgeable user remains constant (i.e., /ii 2 and /ri 3 are fixed), the rate achieved 
by conventional random coding is constant across the range. 



(a) fJ .12 = M 13 = a, A‘21 = A ‘23 = 5, and ^si = M 32 = | 


0.5 

1 1 1 

1 1 1 

0.45 

( 

< 

( 

(1 

0 

0 

0 

1 




0.4 




0 Upper Bound 


0.35 

—X— Hybrid Coding 



- - Grouped Random Coding 



—1— Conventional Random Coding 

1 ! 1 


0 0.1 0.2 0.3 0.4 0.5 


c 



(b) /ii2 = /T21 = b, fiis = /T31 = and /T23 = M32 = | 



Fig. 6. Inner and outer bounds on the symmetric capacity of example 3-user BIC problems: (a) One-sided side information symmetry, and (b) paimise side 
information symmetry, while varying the least knowledgeable user’s side information under; and (c) one-sided side information symmetry, and (d) pairwise 
side information symmetry, while varying the most knowledgeable user’s side information. 


With the figures, we highlight the following observations about our inner and outer bounds: 
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1 ) 


2 ) 

3) 


There exists a threshold for side information parameters where below this threshold, in the best hybrid coding strategy all 
three users decode all messages and thus the achieved rate is the same as conventional random codes. In particular, this is 
true for small a and b in Figures 6 (a) and 6 (b) and larger c and d in Figures 6 (c) and 6 (d) respectively. However, beyond 
this threshold (larger a and b and smaller c and d), we observe a clear potential for increased rate from hybrid codes. 
It is worth noting that the regimes where hybrid codes offer a rate increase are those further from the fully symmetric 
BIC problem (where all network parameters, /i^, are the same). Recall that for the fully symmetric BIC problem the 
entire capacity region is achievable using conventional random coding (see Proposition |^. 

In Figures 6 (a) and 6 (b)| when a = 1 or 5 = 1 there exist no opportunities at all to exploit the side information at User 1. 
Hence, both hybrid coding and grouped random coding achieve the genie upper bound. 

Although there exists a gap between our inner and outer bounds, we highlight a specific case where our new hybrid 
coding scheme both provides strictly positive rate gain over conventional random coding and meets the new upper bound: 
in Figure | 6 (d) when d = 0. This scenario is related to the one considered in the motivating example of Section III in 
the sense that Users 2 and 3 know each other’s complete message as side information. 


VII. Blind Index Coding over Wireless Channels 

In this section, we generalize the BIC problem model further to consider the impact of uncertainty not only within the side 
information given to users, but also in the sender-to-user broadcast channel (recall that in the BIC problem this channel was 
error free). In particular, we emulate loss of packetized transmissions due to fading in wireless channels using a binary fading 
model for the sender-to-user broadcast. Consequently, the problem considered here will be referred to as blind index coding 
over wireless channels (BICW). 

As we will see, considering wireless transmissions adds new challenges to the problem, and surprisingly repetition of uncoded 
bits (within the hybrid coding framework) will become a powerful technique for increasing achievable rate. Unlike the BIC 
problem considered in the previous sections, even the 2-user BICW problem is nontrivial. Hence, in this section we focus on 
a a 2-user problem representative of general BICW problems. After formally defining the representative problem, we define a 
hybrid coding scheme that not only XORs randomc combinations of some messages with uncoded bits of others, but also uses 
repetition of uncoded bits. We derive the achievable rate regions of these hybrid codes with repetitions, and then denomstrate 
numerically the resulting gain in achievable rate that our scheme provides over conventional methods. 


A. Wireless Broadcast Channel Model 

In the BICW scenario the channel output received by by User i, iji, is governed by a binary fading process. Specifically, let 
7 i be a binary vector with the same length as the channel input vector x and drawn i.i.d from a Bernoulli(l — e^) distribution. 
The channel output for User i is given by the input-output relationship 

y^[i] = (35) 

User i knows 7 ^, however the sender is only aware of parameters {ci}, which govern the probabilistic behavior of the sender- 
to-user broadcast channel. 

In this section, we assume the model depicted in Figure containing only two users where ei < £ 2 , /ii 2 = 1, and p, 2 i = p. 
(i.e.. User 1 has a better channel than User 2 but no side information). 



Fig. 7. 2-user instance of the BICW problem. 


Remark 15. We assume that ei < 62 and that side information was only given to User 2 (i.e., p 2 i = 1) for ease of exposition. 
In all other 2-user settings (i.e., arbitrary ei and £2 and side information at either user), either there is no index coding gain 
even if the sender knows the side information or the natural generalization of our proposed scheme recovers some index coding 
gain to outperform conventional approaches. 

Our main result for this setting is as follows. 

Theorem 7. For the 2-user BICW problem defined above, the rate region TZ is achievable, where TZ is the set of all non-negative 
rate pairs {ri,r 2 ) satisfying. 


O + ^2 < 1 - £ 1 , 


( 36 ) 
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u}i{L)ri + LU2{L)r2 < 1 - e2, L = 1,...,L^ 


(37) 


where 


wi(7v) — 


1 h 


1 - ei 
uj 2 {L) = min 


ei + fl{l — 62 )uJ 2 {L) + L (1 — 62 ) (1 — UJ 2 {L)) , 


1 

L ’ ^ r > 


log(M) 


(38) 

(39) 

Llog(ei/e2)J ■ 

Remark 16. Notice that as 62 —> 0 (and by the assumption 62 > ei, as ei —0), the BICW problem reverts to a BIC problem. 
Moreover as 62 0, uJi{L) —>■ /r and l^ 2 {L) 1, resulting in the achievable region of rate pairs satisfying; 


ri + r2 < 1, 
ftri + r2 < 1, 

which is equivalent (given assumptions on /ii 2 and /i 2 i) to the 2-user BIC capacity region (formally stated in Proposition]^. 


B. Proof of Theorem 

This section is organized as follows. We first define the hybrid coding scheme by specifying a class of generator matrices 
which map length-m message vectors to length-n codewords, and which are parametrized by three quantities: p, L, and a. For 
each n, the transmitter maps two messages, wi and W 2 to codewords using corresponding generator matrices (with different 
parameters), and XORs the two codewords to produce the channel input vector. We then specify the method of decoding and 
establish the achievable rate region for our coding scheme when fixing the generator matrix parameters for all n. By doing 
so, we show that for any (ri,r 2 ) £ TZ (as defined in Theorem there exists a choice of parameters such that (ri,r 2 ) is 
achievable, thus proving Theorem]^ 

1) Encoding: Our hybrid coding scheme encodes wi and W 2 separately and linearly, before combining the resulting 
codewords through bit-wise XOR. The codeword for each message is constructed in a manner similar to the component 
of BIC hybrid codes from the previous section specific to a single message component: uncoded repetitions of message bits 
are supplemented by a random linear combinations. The specific mapping from message to codeword is formalized in the 
following definition, parametrized for a given n by three quantities p, L and a: 

X m RRP matrix with parameters p £ [0,1], L £ N, and 

Al+i 0]^, (41) 

if£<L 
else ’ 

and B is a pn X m matrix with entries drawn i.i.d. from Bernoulli (|). For feasibility, we require that am is an integer, and 

772 

(L + a)-h p < 1. (42) 

n 

Remark 17. Simply stated, an RRP matrix maps a length-m message vector to a length-n codeword by repeating each uncoded 
message bit either L or L + 1 times. The parameter a specifies the fraction of bits repeated L + 1 times, while p specifies the 
proportion of length-n codeword reserved for random linear coded parity. Inequality ( |42| i ensures that U is an n x m matrix. 

It is worth noting that in the hybrid encoding scheme described for 3-user (non-wireless) BIC, the mapping of message wi, 
W 2 , and W 3 to sequences before XOR (i.e., the individually colored bars in Figure]^ could be interpreted as RRP matrices. 
For wi, we chose L = a = 0 and for messages W 2 and W 3 we chose L = 1 and a = 0. The use of RRP matrices with L > 1 
and a > 0 (i.e., the repetition of uncoded message bits) is the key innovation to hybrid coding that enables higher rate in the 
wireless setting. 

Using the defined RRP matrices, we now describe the encoding scheme that maps messages wi and W 2 to a length-n channel 

input vector. Let n, m$^"\ and m^"^ be given. For each n, let Ui be a n x m^"^ RRP matrix with parameters (pi, Li, ai) and 

U 2 be a n X m^"^ RRP matrix with parameters (p 2 ,T 2 ,Q; 2 )- The channel input vector x" is given by (assuming modulo-2 
addition): 

x" = [Ui U 2 ] 


Wi 

W 2 


Definition 2 (Repetition plus Random Parity (RRP) Matrix). An n 
a £ [0,1] is a binary matrix, U, with the form: 


U= IRT at ... 


where 


A, = 


0 ] 
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ai = 0.5 


-pi = 0.25- 


-Li = 2- 






Wi 


-mi - 









- n -► 


Fig. 8. An example hybrid coding scheme for the 2-user BICW setting, where (pi, Li, ai) = (0.25, 2, 0.5), and (p 2 , L 2 , 0 : 2 ) 
represent uncoded bits, shaded boxes represent RLCs of a single message. 


(1, 0,0). Outlined boxes 


= UiWi + U2W2- 


Figure depicts an example hybrid encoding with repetitions for the 2-user BICW setting. In this particular example, L = 2 
and a = 0.5. 

2) Decoding: We now specify the decoding strategy and then characterize the achievable rates for our scheme with fixed 
parameters pi, Li and at, i = 1, 2. In what follows, we choose (p 2 , 012 ) = (1, 0, 0) (i.e.. User 2’s generator matrix, U 2 , is 

a random matrix). Choosing parameters (pi,Li,ai) is more nuanced and will be addressed within the analysis. For brevity, 
we will not explicitly analyze the error rates of our scheme for given n, but instead provide a sketch of the achievability proof 
using existing results for random linear codes over point-to-point erasure channels. 

In our decoding strategy. User 1 first decodes W 2 and peels its interfering contribution from its received signal, and then 
decodes its desired message, wi. User 2 only decodes W 2 - We first describe decoding W 2 at each user. 

Recall that the channel input at any time, t, is given by x[f] = Ui(f, :)zui + \J 2 {t,:)w 2 , where Ui{t,:) is the <-th row 
of generator matrix U^. The decoding strategy for W 2 used by both users is based on the following observation. If t 
and t' ^ t both correspond to a repetition of the same message bit from rui, then the modulo-2 sum of these yields 
x[f] -f x[f'] = (U 2 (f,:) + U 2 (f^ ■))w 2 , which is a random linear combination of only W 2 bits (since p 2 = 1). By this method 
we “clean” equations of wi. User 2 has the additional option of using its side information to clean equations, which has the 
same essence. 

The cleaned random linear equations are used by each user in conjunction with those that by construction were only functions 


of W 2 (i.e., for those t where in (41 1 Ui(f,:) = 0) to decode W 2 - After decoding W 2 , User 1 removes the contribution of 


W 2 from its received signal before decoding wi. If any of these decodings fail, then an error occurs. We now claim that the 
decoding scheme yields the following achievable rates, proven in Appendix [P] 

Lemma 8. Consider the 2-user BWIC problem defined by network parameters ei, € 2 , and p, and let pi S [0, 1], Li € N, and 
ai € [0,1) be fixed. A rate pair (ri,r 2 ) is achievable if it satisfies, 

I- Pi 


ri < 


Li ai 


ri < pi- 


1 - ei 


[l-e[ 

4 


//[I — € 2 ^ + 0;i(e. 


/ Li\ 

aiUi 

Li 

2 


— e 


Li + l 
Li + 1 


- ai(ei' - 
)]ri +r 2 < (1 - ei)(l - pi), 
)]ri -t- r2 < (1 - e2)(l - Pi). 


ii+i 


)’ 


(43) 

(44) 

(45) 

(46) 


From Lemma 1^ it is clear that by considering the union or achievable rate pairs over all {pi, Li,ai) we arrive at the rate 
region achievable by our schemes. Specifically, let TZ{pi, Li,ai) for pi G [0,1], Li G N, and ai G [0,1] be defined as the set 


of all pairs (ri,r 2 ) satisfying (43 h46i, and we define a rate region: 


U 7Z(pi,Li,ai). (47) 

To complete the proof of Theorem |7] we now demonstrate that the region 7Z (as defined in Theorem is contained within 
7Z (given in (47 1 ), and thus is achievable. To do so, we need only show that for every rate pair (ri,r 2 ) G TZ, there exists 
parameters {pi,Li,ai) such that (j4^-(j4^ are satisfied. We therefore fix ri to any value in the interval [0,1 — ei], and choose 
parameters pi, L*, and a* as 


L* = maximize min{L,Lmax} 
subject to L gN 

<1- Lri 


(48) 


0 

i-’-i 


if r * _ r 
11 ±j^ — J-'max 


if LI < L„ 


( 49 ) 
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Pi = 


^ / L-t 

- a*(e/ - e 


+1 N 
1 . 


1 - ei 




(50) 


where L^ax is as defined in (40 1 . Notice that given ri, we first determine the appropriate L\, then a\, then finally p\ and 


that both ( |4?| and (44i are satisfied by the chosen parameters. 

Substituting these into (451 and ( |46| ), we see that r 2 is achievable if it satisfies both of the following inequalities: 

r2 < (1 - ei) - pI{ 1 - ei) - ri 1 - 

= 1 - ei - ri, 


(51) 


r2 < (1 - £ 2 ) - Pt(l - £ 2 ) - rip 


1-62* +at(e 2 ^ -£2^+' 


= (1 - £2) 1^1 - ri 

(a) 1 - £2 - UJijLDn 
012 (f^l) 


1 - £1 


+ P- 


1 - £2 


(4= 


LI 


(52) 


where in (a) we compared the evaluated expression with a;i(L) and ui 2 {L) as defined in (38 1 and (391 evaluated at Li = L\. 


We now point out that 0 is equivalent to ( [3^ and ( [5^ is equivalent to <0 evaluated at L = Moreover, since 


1 - £2 - a;i(L*)ri . 1 - £2 - a;i(L)ri 

-^——- > mm -^—r-. 

L 022 (i) 


(53) 


and the right hand side of inequality represents the tightest version of (37 1 for fixed ri, we observe that any (ri, r 2 ) satisfying 
(36 1 and (37 1 for all L < Lmax (i-e., any (ri,r 2 ) S TZ) is indeed achievable, thus completing the proof of Theorem]^ 


C. Numerical Results 

For blind index coding over wireless channels, we recall that the key difference was the usefulness of repeating uncoded 
bits within the hybrid coding scheme. Therefore, we now provide numerical results for three BlCW scenarios, characterized by 
£ 1 , £ 2 , and /i. In each, we plot TZ and highlight regimes (along x-axes) wherein the number of repetitions used in our scheme 
increases. For each scenario, we point out the gain in r 2 offered by repetion-based hybrid codes over conventional schemes, 
and for further comparison we also depict rate regions achieved by: 1) Conventional random codes as defined in the beginning 


of Section IV-B 2) Time-Division between separate random encoding of wi and W 2 , and 3) the following genie-aided upper 
bound: 


Proposition 9. For the 2-user BICWproblem setting considered in Theorem^ an achievable rate pair (ri,r 2 ) must satisfy 

<l-£i. (54) 


max < ri -I- r 2 , p,ri - -r 2 

1 - £2 

Proof: The bound may be separated into two outer bounds that correspond to the first and second terms within the max, 
respectively: 

• ri -f r2 < 1 - £ 1 , 

• -f r 2 < 1 - £ 2 - 

Denote the subvector of wi given as side information as w'^ and the complementary subvector as wf. We prove the first 
bound by applying Fano’s inequality at each user to observe: 

nri < I (yi,7i;u;i) -f o{n) 

= I {yi,li\wi) + o{n) 

= H {yi\ji) - H (j 7 i| 7 i, wi) -f o(n) 

(a) 

< ff (yilji) - (y2l72,wi) -h o(n) 

< H (yilfi) - H (fj 2 \l 2 ,wt) +o{n) 

<n{l- ti) - H [y 2 \ 72 ,w^) + o(n), (55) 

nr2 < I (^2, 72, <?2i, 521:^2) + o{n) 

= H (2/2172, <? 21 ,521) - H (2/2172, <? 21 , 521,^2 

< H (2/2172, <?2i,ff2i) +o{n) 

= H {y 2 \ 72 ,wf) +o{n), 


) + o{n) 


(56) 
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where in step (a) we observed that because the sender does not know the fading channel state of the sender-to-user channel. 
We complete the proof of the first bound by combining ( [55] l and and normalizing by n as n grows large. 

To prove the second bound, we consider a genie which provides the sender of knowledge regarding which bits of wi are 
given as side information to User 2. We again applying Fano’s inequality at each user, but in a different way, to observe 


nfiri < I + o(n) 

< I {yi,'ii,wt,W2;wY) + o(n) 

< H {yi\'yi,w^,W2) + o{n), ( 57 ) 

nr2 < I {y2,l2,$2i,g2i,w-^ +o{n) 

= H (1/2172, <? 21 , 52 i) - H (1/2172, <?2i, 521,1^2) +o{n) 

< n{l - €2) - H {y2\j2,wt,W2) + o{n), 

{b) ^ 

< n(l - 62) - u- H {yi\^i,w^,W2) +o(n), ( 58 ) 

1 — 62 

where in step (b) we applied Lemma 1 of 021 which when applied to our problem states that (because the sender does not 
know the binary fading channel states {7^}), 


H {y2\l2,wt ,W2) > 


1 - ei 
1-62 


H {yi\7i,wt,W2) ■ 


To complete the proof of the second outer bound, we scale (57 1 by and combine with (58 1 . ■ 

In Figure 9(a) ei = i, £2 = |, /r = | notice that when ri is near the point-to-point capacity of 0.5, hybrid coding recovers 
all of the available index coding gain. This is because when ri is near 0.5, the primary challenge is not blindly exploiting side 
information, but rather accounting for interference incurred at User 1. For this set of network parameters, we point out that 
for any fixe d valu e of ri, hybrid coding offers at least 62% of the available index coding gain. 

In Figure 9(b) ei = i, £2 = ^, /i = ^ we consider a BICW setting where side information is plentiful (User 2 knows 
90% of iFi). In this case, L^ax = 4 and the piece-wise linear boundary of the hybrid coding achievable rate region has more 
linear segments, with segments corresponding to the number of repetitions used. For this setting and for any fixed ri, HRC 
always achieves at l east 68 % of the available index coding gain. 

Finally, in Figure 9(c) £1 = 4, £2 = |, /i = we consider a BICW setting with very little side information (User 2 knows 
10% of wi). In this case, L^ax = 1 and from the figure, it is apparent that although any index coding gain is modest, it is 
still strictly positive for all ri ^ { 0,1 — £ 1 }. 

From the scenarios depicted i Figure we make the following unifying conclusions: 


1) Regardless of the network parameters (£ 1 , £ 2 , and fi) hybrid coding always increases the achievable rate region. 

2) If we consider a fixed ri, the number of repetitions used in the hybrid encoding scheme increases when User 2 has a 
weaker channel and more side information (i.e., £2 grows larger and y, grows smaller). 

3) Hybrid coding can be capacity achieving, as seen on the boundary of the rate regions in all three figures when ri is 
close to it’s maximum. 


VIII. Concluding Remarks 

In this paper, we introduced a generalization of index coding called blind index coding, which captures key issues in 
distributed caching and wireless settings. We demonstrated that the BIC problem introduces novel and interesting challenges 
that require new analytical tools through three main contributions: 1 ) we proposed a class of hybrid coding schemes which 
mix uncoded bits of a subset of messages with randomly linear combinations of other messages, 2 ) we presented new outer 
bounds that leveraged a lemma based on a strong data processing to capture the lack of knowledge at the sender, and 3) we 
demonstrated that in scenarios where the sender-to-user channel is not error-free (specifically, a wireless binary fading channel) 
repetition of uncoded bits within hybrid codes can further increase the achievable rate. 

To further emphasize the importance of analyzing BIC problems, we refer the reader Figure [T^ which depicts the setting 
considered Q, which itself was a specific case in the broader class of multiple unicast and multiple multicast problems in 
wireless erasure networks 1201 . Such problems consider the communication of multiple distinct messages to different users in 
a wireless network over probabilistic lossy links. 

A key contribution of 0 was the revelation that it was strictly suboptimal for relays within such a network to apply 
conventional random network coding. Instead, relays imparted structure into their network coded transmissions by XORing 
unmixed received bits of one message with random combinations of another; i.e., relays applied a version of hybrid coding 
to their received signals in order to outperform conventional random network codes. From results presented in this work, one 
arrives at such a relaying strategy naturally. From the point of view of either relay, the transmissions of the other relay are 
side information, and more importantly, due to the lossy nature of links the relay is blind as to what side information was 
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Fig. 9. Rate regions achieved by different schemes — Conventional random codes (blue), time-division between separate random codes (green), hybrid 
coding (red), and genie-aided (non-blind) index coding (white) — for three different 2-user BICW problems. The number of repetitions used in the hybrid 
coding scheme is stated along the x-axis. (a) For this setting, Lmax = 2; (b) For this setting, Lmax = 4 and we have emphasized using dashed lines bounds 
j36^ and j37^ for all L that comprise the boundary of 7?.; (c) For this setting, Lmax = 1 and notice even with very little side information, our hybrid coding 
scheme strictly outperforms conventional schemes. 


provided. Additionally, the transmission model from relays to destinations matches precisely the lossy sender-to-user broadcast 
considered in Section IVIII 

It is important to point out that the BIC and BICW problems in the general setting remains an open problem. Therefore, to 
conclude the paper we revisit one class of interesting symmetric side information BIC problems (from Section [Vl] l that remains 
unsolved and yet offers a simple and concrete enough case for progress to be made, potentially revealing new insights. Consider 
the following 3-user BIC scenario when side information parameters are pairwise symmetric: /ii 2 = ^21 = a, 7^13 = = b, 

7^23 = 7232 = c with a > b > c (for a concrete example we refer the reader to Figure 6(d) 1 . From Theorem we hnd the 






















Fig. 10. The symmetric two-hop erasure broadcast channel from O with focus on the embedded BICW problem seen by Relay 1. The network consists of 
two hops of communication. The first hop is an erasure broadcast channel, whereas the second consists of two parallel, non-interfering erasure broadcasts. 
Destination 1 wants message wi and Destination 2 wants but with no knowledge of erasures, Relay 1 is unaware of the (side) information provided by 
Relay 2. 


symmetric achievable rate: 


= max 


1 


1 


l + a + 6 + c — a6’ l + a + 6 


and from Theorem]^ we have the capacity bound: 


^sym — 


\ d b — 


(a —c) (&—c) 


(59) 


(60) 


Notice first that, as in the numerical example of Figure 6(d) if c = 0 or c = 6 the upper bound is tight and capacity is 


achieved. However, within the interval c S (0, b) there exists a gap between achievability and converse. 

Additionally, recall that the first quantity in the max of ( [59] l is the rate achieved by hybrid coding and the second is by 
conventional random coding. Clearly, hybrid coding provides a rate gain when c < ab. This regime is one where the side 
information Users 2 and 3 have about each others’ messages is large and thus Phases 1 and 2 in Figure are small. Our 
hybrid coding assumes that User 1 ignores these phases, but when they are larger (i.e., as c grows) these transmissions may 
be used by User 1 to decode messages W 2 and w^. In particular, the case where c = ab {a point notably within the interval 
(0, b)) represents a threshold where the structure of our hybrid code can no longer expect to hide linear subspaces of W 2 and 
W 3 from User 1. 

We conjecture that at this threshold, any method of encoding W 2 and that satisfies the decodability condition at Users 2 
and 3 also allows User 1 to decode W 2 and (i.e., at this threshold it is the converse and not achievable scheme that may 
be tightened). 
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Appendix 

A. Evaluating , g'); (fij, fij)) 

We now prove that ,(f)\ {4>ij-,gij)) = by showing that it may be bounded both from above and below by the 

same value. We hrst address the upper bound: 

max 

OL /3 

< s*{(f'[l],g'[l])-{fij[l],gij[l])). (61) 

In step (a) we apply the tensorization property of s*(- ) m, and in (b) we observed that all variables are i.i.d. across 
£. To simplify exposition, we now use the following notation; Let Pa{-) and Ppf ) denote probability mass functions for 
(^'[ 1 ], p'[l]) and {(j)ij[l], gij[l]) respectively, and let Qa{-) and Q/sf) be arbitrary probability mass functions for (^'[ 1 ], ^'[l]) 
and (fijW], gij[l]) respectively. Note that the support of both ((^'[1],and [1], [1]) is {(0,0), (0,1), (1,1)}. Using 

this notation, we now observe 


qJp^D(Q^\\P^) 


= sup 




-f P/3(l,l)l0g 


Q/ 3 ( 0 , 0 ) 


Q/3(i,i) 


/D(Q^\\P^) 


= sup 


(<5 + (1 - <5)P„(0, 0)) log ( + (1 - 1) log 


+ (1 - <5)Ua(l,l)log 


5 -f (1 — (5)(5 q(0, 0) 
(1-5)P„(1,1) 


( 1 -^)Q„( 1 , 1 ) 


/D(Q„||P„ 


< sup 

Qc/P. 




+ (1 - <5)Ua(l,l)l0g 


(l-^)Qa(0,0) 

(1-5)P„(1,1) 


(1-5)Q„(1,1) 


/D(Q,||P„ 


( 62 ) 
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Qct^Pa ^\Qa\\Paj 1 ^ 

Step (c) is verified by observing the following. Trivially we have 

'<5 + (l-^)P„(0,0) 


Since for any positive y, U, and V: 


< max (y + (1 - S)Fa(0,0)) log 


ye [0.1] 


y+(l-(5)P„(0.0) 
2/ + (1-5 )Qo(0,0) 


(63) 


(64) 


^(y + U) log 


yJ-J7 

y + V 


= 1 + log 

< 0 , 


y + U\ y + U 


y + U J y + V 


we observe by letting U = (1 — d)Pa(0, 0) and V = (1 — S)Qa(0, 0) that the right hand side of (64i attains its maximum at 

y = ^- ^ ^ 

To show that s*{{(j)',^); ((j>ij,gij)) > , we restrict the domain of Qa to mass functions where (5ct(0, 0) = Po,(0, 0) 


and observe from 

((5 + (1 - i5)P„(0, 0 ))log 


= sup 

Qc^Pc 


S+(l-S)Pa(0,0) 


+ (l-5)P„(l,l)log 


> sup 

QcT^Po 

Q„(0,0)=Po(0.0) 


i5 + (1 — S)Qa(0, 0) 

(1-(5)P„(0,1) 


+ (l-,5)P„(0,l)log 


(l-,5)P„(0,l)log 


(1-^)Q„(0,1) 


+ (l-,5)P„(l,l)log 


(l-S)Pa(0,l) 

(l-6)Qa(0,l) 


(l-S)Pa(l,l) 

il-S)Qa{l,l) 


/D{Qa\\Pa) 


/I r\^(Qa|l'fa) w r 1 

aS?.. = = T377' 

Q„(0,0)=Po(0.0) 


(65) 


Remark 18. Note that the validity of Lemma only requires the upper bound (63 1 . However, by evaluating the lower bound 
(65 I as well, we may confirm the exact value of s*{{(j)',^); (^ij,gij)). This value has an intuitive interpretation as the success 
probability of the channel that takes each bit of the virtual signal (j)' as input and gives (pij as output. 


B. Proof of 

If Hkj > we observe that Wi and (4>ij,gij) are statistically enhanced versions of (4>ki,gki) and (<f>kj,gkj) respectively. 
We may further enhance the side information of User k by also providing {(pikjfik)- Applying Fano’s inequality at User k 
with side information enhancement, we find 


nrk < I{x,(j)ki,gki,(l)kj,gkj]Wk) + o(n) 

< I{x,Wi,4>ij,gij,(j)ik,gik',Wk) + o(n) 

= lifik.giklWk) + H(x\wi,(l)ij,gij,(l)ik,gik) - H{x\wi,fij,gij,Wk) + o(n) 

= n(l - yik)rk + H(x\wi,(j)ij,gij,fik,grk) - H{x\wi,(j)ij,gij,Wk) + o(n) 

(“) 

< n(l - yik)rk + H(x\wi,(l)ij,gij,(l)ik,gtk) - ^J,^JH(x\wi,Wk) + o{n), (66) 


where in step (a) we applied (15i from Lem ma by letting V = (wi,Wk)- 

of (| 66 P and 


By combining ( [2T] ) and scaled versions of (|66|l and ( [26| , and taking the limit as n grows large, we arrive at ( [20| . Similarly, 
if Hjk > Pik we may switch the roles of Users j and k in the above analyses to arrive at a similar conclusion. 


C. Proof of Theorem 

We now formally prove Theorem The following notation and claim will simplify exposition of the proof. Let be a 
length-ruj vector of i.i.d. Bernoulli random variables that take a value of 0 with probability 77 , and let 

Notice, for instance, that is statistically equivalent to (fij,gij), and that -ifj and ifj are equal to (wj,!) and ((5,0) 

respectively. Additionally, we define = (4'^J^\(f''^^)- We now formalize the notion of statistically enhanced side information, 
with the following claim, which is consequence of the sender being blind to the precise side information; 
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Claim 10. Let 771 > 772 be given. For any k G {1,..., K} and V independent of Wj, and we have 






and 


,V). 


XV2 


( 67 ) 


( 68 ) 


Proof: The proof is an immediate consequence of the sender being blind to the side information channels: since each 


quantity is a function only of the marginal distribution of we may in fact define ^ physically degraded version 

of Hence, the right side of (67i can be seen as the mutual information between a message and an enhanced channel 

output, and the right side of (681 caiTne seen as an enhance signal adding conditioning. ■ 

The proof now proceeds as follows. At each node (£, i) in the OBT with i < K, we will apply Fano’s inequality to a virtual 
user that desires message This virtual user is given side information signals that are statistically enhanced versions 

of {(l)v[e,i],j} (i- 6 - the actual user in the BIC problem). The statistical properties of the side information channels governed 
by {r]j[i,i]}j as defined by the OBT structure and (31 1 . We will see that by applying Lemmato the expansion of Fano’s 
inequality at each node, and scaling the resulting scaling expressions according, that terms on the right hand side of each 
expression will cancel and we will arrive at the stated bound. 

We denote the complete collection of side information and channel state information given to a virtual user represented 
by the i-th node in level £ of the OBT as Similarly, we denote the collection of side 

information/channel state given to the actual User v[£,i] as 

If i is odd, recall from (31 1 that 77 „[f ,][£, i] = 1 which implies that none of the virtual user’s desired message is provided as 
(enhanced) side information. Thus, for off i and £ < K — 1 we have 

nry[t,i\ < I{x, + o{n) 

< I{x, 'i>[£,i];wyie^i]) + o{n) 

= H{x\'^[£,i]) - H(x\'i[£,i],Wyie^i]) + o{n) 

< H{x\^[£, i]) - C,[£, i]H{x\%£ + 1,- 1]) - (1 - ([£, t])H{x\^[£ + 1, 2i]) + o{n), (69) 

where in step (a) we applied a combination of Lemmaand observing from (31 1 that ^[£ + l,2i — 1] or ^\£+\,2i] can only 
increase conditioning relative to i], 

If i is even, some of the virtual user’s desired message may have been provided as side information. Thus, for even i and 
£< K — 1 we have 

+ o(n) 

< + oin) 

= + H{x\^[£,i\) - H{x\^[£,i],Wy[i^i^^) + o{n) 

= n{l - Vv[e,i][£,i])ry[e,i] +H{x\^[£,i]) - H{x\^[£,i],w.^[g^,]) + o{n) 


tm 


W ( 1 

— 77 ^ 1 Xlv[£,i] 

ic) 

< 77 1 - 77 „ 


£- 1 , 


£- 1 , 


rvie,i] + H{x\'£>[£,i]) - H{x\'i>[£,i],Wyie^i]) + o{n) 
rv[£,i] + H{x\-^[£,i]) 


- C[£, i]H{x\^[£ +l,2i- 1]) - (1 - C[A i])H{x\'^[£ + 1, 2i]),+o{n). 


(70) 


In (b) we applied (311, and in (c) (a) we applied a combination of Lemmaand observed from (31 1 that \i/[f + 1,27 — 1] or 
+ 1 , 27 ] can only increase conditioning relative to f>[£,i],Wyii^ij. 

We now address the base cases (i.e., when £ = K — 1). We first observe that at the K — 1 level, because all paths 
from root to leaf are permutation of all user indices, if j ^ {v[K — \,i\,v[K,i]] then rjj[K — \,i] = 0. Equivalently, if 
j i {v[K - l,i],v[K,i]} then 

Now recall that if i is odd, then r]v[K-i,i][K — I,?] = 1, and from Fano we have 

^^v[K—l^i] — ^T ^v[K—l,i]i'^v[K—l,i]) “t“ 0 ( 77 ) 

< I{x,^[K - l,i];WyiK-i,i]) + 0 ( 77 ) 

= Hixl^K - l,i]) - + o{n) 

id) 

< H{x\'^[K - 1,?]) - ri^iK,i][K - l,*]77(^|{wj}j/„[K,j]) + 0 ( 77 ), 


( 71 ) 
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where in step (d) we applied If i is even, then — l,i] can be less than 1, and from Fano we have 

^v[K'^v[Ko(n^ 

< I{x,'^[K - + o{n) 

= - 1,*]) - + o{n) 

= n{l - Vv[K-l,^][K - + H{x\^[K - l,i]) - {wj}j^yiK,t]) + o{n) 


^ I 1 Vv[K—l,i] 
(e) 

^ ^ I 1 Vv[K—l,i] 


K-2, 

K-2. 


rv[i,i] +H{x\^[K - 1,2]) - + o{n) 


rv[K-i,i] +H{x\'^[K - l,i]) - 'ny[K,i][K - l,i]H{x\{wj}j^y[K,i]) + o(n), 

(72) 

where in (e) we applied 

Finally, at the iF-th level of the OBT we have trivially 

nryiK,i] < I{x, ^v[K,i\-,Wy^K,i]) + o{n) 

< I{x,{wj}j^y[K,iy,Wy[K,i]) + o{n) 

< H{x\{wj}j^y[K,i\) + o{n). (73) 


Using (69i-(73i, and scaling expressions according the the coefficients i] at each node, we recursively arrive at 

nF^]!, 1] < H{x\^y[i^i\) + o(n) 

< n + o(n), 

as desired. □ 

D. Proof of Lemma 


Consider (pi,Li,ai) fixed and rate pair (ri,r 2 ) such that (43l-(46l are satisfied. To prove the lemma, we demonstrate that 
there exists a scheme and that using this scheme (ri,r 2 ) is achievable according to Definition To do so, we first show 
that if (43 I is satisfied, then a sequence of Ui matrices (and by proxy encoding functions) exists. We then argue that using 
the described encoding and decoding strategies with (p 2 ,U 2 ,Qf 2 ) = (1,0,0), the usual equation counting argument applied to 
conventional random codes in erasure channels suffices to prove that error probability vanishes as n ^ oo. 

If (43 I is satisfied, then for every n, we may choose = [nrij, thereby satisfying (421 and guaranteeing the existence of 

a RRP matrix for wi. Furthermore, we see that lim„_,,oo = Xi. Bearing this in mind, we let = [nr 2 \ for each n and 
consider the scheme that uses RRP matrice with parameters {pi,Li,ai) and {p 2 , L 2 ,a 2 ) to encode wi and W 2 respectively. 

Recall that in our decoding strategy, W 2 is decoded at each user by first extracting clean equations of W 2 (where by clean, 
we mean the contribution of wi can be canceled out as explained previously). Then, the next step is decoding W 2 from the 
clean equations. We now calculate the probability that at time t User i, where 2 = 1,2, can extract a clean equation of W 2 for 
three cases: 


• If f = (f — 1 ) 772 $^”^ + k, where i is an positive integer with i < Li and 1 < fc < then Ui(f, :)2Z;i is the f-th 

repetition of 2Zli[fc]. The probability of receiving a clean equation at User 1 is r]i(t) = (1 — ei)(l — and at User 2 is 

V2{t) = (1 ~ e2){l — 7tC2~^)> which are the products of probabilities that User i at time t receives an unerased transmission 
and has previously received a transmission (or side information in at User 2 ) containing the same 2Z?i[A:]. 

. If < t < + Pin, then Ui(f, :)2(;i is random combination of wi bits, and 271(f) = 272(f) = 0 (i.e., to 

decode W2, each user ignores these transmissions). 

• If Li 272 ^"^ + Pin < f, then Ui(f, :)wi = 0 , and the probability of a clean equation is the probability of an unerased 
transmission: 271(f) = 1 — ei and 272(f) = 1 — £2. 

From capacity analysis of point-to-point erasure channels, we note that a random linear coded message, m2, with rate r2 is 
decodable at User 1 with arbitrarily low probability of error as 27 —00 if the number of received random linear equations of 
W2 is sufficiently large. Specifically, by counting the number of clean random linear equations of W2 received by User 1 , we 
see that r2 must satisfy: 

1 " 

r2 < lim 


(n) 


>00 27 




= lim — 

n—>-oo Ti 


(aim)"* Li-l-1 

^ £(l-ei)(l-eri) + 

\ fc=i e=i 




y](l - ei)(l - Cl ^)-f (n - Li 277P - aimp - Pi27)(l - ei) 


-\-l 
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= (1 - ei)(l - pi) - ri 1 - + ai(ef^- 


ii+i 


(74) 


Through analogous analysis, we find that communicating of W 2 to User 2 is possible with arbitrarily low error probability if 


r 2 < (1 - e 2 )(l - pi) - ri/i 1 - ^ + aiie^^ - 62 ^+^) 


(75) 


Notice that (74 1 , (75i are equivalent to ( |4^ , ( |46| and thus if both expressions are satisfied, then W 2 is decodable at each user 
with high probability. 

We now address achievability of ri assuming that User 1 has already successfully decoded and canceled W 2 from its received 
signal. It is sufficient to show that ri satisfying any one of ([4^-(|46]l is achievable, and we do so by proving achievability 
of Ti satisfying (44 1 . Observe that the repetition portion of the RRP matrix supplies through the erasure channel a subset of 
message bits to User 1. Even if the repetition-code-supplied bits are removed, note that the random linear code portion of the 
RRP matrix still represents a random linear code applied to bits unknown after repetition. To decode, the total number of bits 
received though repetition and linearly independent equations received must be equal to 


(n) 


Using Hoeffding’s inequality lISTIl . one can show that with high probability as n 
repetition coding is concentrated around its mean. 


00 the number of bits received through 


E 

/c=l 


Li + 1 


E ' 




( Li ( Li 

[^1 )]■ 


^1 — -I- bits remain to be communicated using 


Therefore, after the repetition phase, approximately m\ 
the random linear coding phase. 

Through the usual argument that random linear combinations are independent w.h.p. as n — 00 , the random coding portion of 
the scheme supplies approximately pin(l—ei) equations. Therefore, as n — c», we may expect the random linear coding portion 
to resolve all message bits of tui that were not received during the repetition phase if ^1 — < 

□ 




pin(l — ei), or simply a rate ri is achievable if it satisfies ri = lim„_>.oo < Pi 


L-\ r 










